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ABSTRACT 


For the IT sector and software specialists, software failure prediction 
and proneness have long been seen as crucial issues. Conventional 
methods need prior knowledge of errors or malfunctioning modules 
in order to identify software flaws inside an application. By using 
machine learning approaches, automated software fault recovery 
models allow the program to substantially forecast and recover from 
software problems. This feature helps the program operate more 
efficiently and lowers errors, time, and expense. Using machine 
learning methods, a software fault prediction development model was 
presented, which might allow the program to continue working on its 
intended mission. Additionally, we assessed the model's performance 
using a variety of optimization assessment benchmarks, including 
accuracy, fl-measure, precision, recall, and _ specificity. 
Convolutional neural networks and its hyperbolic tangent functions 
are the basis of the deep learning prediction model FPRNN-HTF 
(Forward Pass RNN with Hyperbolic Tangent Function) technique. 
The assessment procedure demonstrated the high accuracy rate and 
effective application of CNN algorithms. Moreover, a comparative 
measure is used to evaluate the suggested prediction model against 
other methodologies. The gathered data demonstrated the superior 


How to cite this paper: Swati Rai | Dr. 
Kirti Jain "Study of Software Defect 
Prediction using Forward Pass RNN 
with Hyperbolic Tangent Function" 


Published in = 
International | li 

Journal of Trend in Saige 
Scientific Research oH = 

and Development ppl 
(ijtsrd), ISSN:  Wlatbsheat 
2456-6470, IJTSRD60159 


Volume-7 | Issue-6, 
December 2023, pp.268-273, URL: 
www.ijtsrd.com/papers/1jtsrd60159.pdf 


Copyright © 2023 by author (s) and 
International Journal of Trend in 
Scientific Research and Development 
Journal. This is an 

Open Access article 
distributed under the aa 


terms of the Creative Commons 
Attribution License (CC BY 4.0) 


performance of the FPRNN-HTF technique. 


(http://creativecommons.org/licenses/by/4.0) 


KEYWORDS: FPRNN-HTF (Forward Pass RNN with Hyperbolic 
Tangent Function), precision, recall, specificity, Fl-measure, and 


accuracy 


1. INTRODUCTION 

The presence of flaws in software significantly 
impacts its dependability, quality, and upkeep 
expenses. Even with diligent application, it might be 
difficult to get bug-free software since most defects 
are buried. A significant issue in software engineering 
is also creating a software bug prediction model that 
might identify problematic modules early on. 
Predicting software bugs is a crucial task in software 
development. This is so that user happiness and 
overall program performance may be increased by 
anticipating the problematic modules before software 
is deployed. Additionally, early software problem 
prediction enhances software adaptability to various 
situations and maximizes resource efficiency. 


Numerous research have been conducted on the 
prediction of software bugs with machine learning 
methods. Take the linear Auto-Regression (AR) 
technique, for instance, to forecast the defective 


modules. Based on previous data on software 
accumulated flaws, the research forecasts future 
software errors. The research also used the Root 
Mean Square Error (RMSE) method to assess and 
compare the AR model with the Known Power Model 
(POWM). Three datasets were also included in the 
research for assessment, and the outcomes looked 
good. The research examined the suitability of several 
machine learning techniques for defect prediction. 
The key earlier studies on each machine learning 
approach and the most recent developments in 
machine learning-based software bug prediction. 


2. BACKGROUND 

Robotic programming deformity expectation (SDP) 
tactics are gradually used, sometimes with the use of 
artificial intelligence (AI) processes, according to 
Gorkem Giray et al. [1]. But existing machine learning 
methods need physically removed highlights, which 
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are cumbersome, time-consuming, and only partially 
capture the semantic information disclosed in bug 
describing equipment. Professionals have the 
invaluable opportunity to extract and benefit from 
more complex and multi-layered knowledge as a 
result of profound learning (DL) techniques. 


According to Iqra Batool et al. [2], programming 
engineers may identify problematic builds-like 
modules or classes-early in the product advancement 
life cycle with the use of programming 
issue/deformity expectation. Information mining, 
artificial intelligence, and deep learning techniques are 
used to program expectations that are not satisfied. 


Heterogeneous deformity expectation (HDP), as 
introduced by Haowen Chen et al. [3], refers to the 
imperfection prediction amongst projects with 
different measurements. The majority of current HDP 
methods severely limit their interpretability by 
mapping source and target data into a conventional 
measurement space where each element has no real 
significance. Furthermore, HDP often faces the 
problem of class inequality. 


According to Cagatay Catal et al. [4], phishing attacks 
aim to steal personal information by using 
sophisticated techniques, tools, and tactics. Some 
examples of these are happy infusion, social 
engineering, online forums, and mobile apps. A 
number of phishing location techniques were 
developed in order to prevent and lessen the risks of 
these attacks; deep learning calculations proved to be 
one of the most effective. 


Xieling and others [5], A number of open-source and 
endeavor-supported information diagrams have 
emerged in recent years, marking a remarkable 
advancement in the application of information 
portrayal and thinking into a variety of domains, 
including computer vision and natural language 
processing. The goal of this research is to thoroughly 
examine the current state and trends of information 
diagrams, with a focus on the topical examination 
structure. 


Cagatay et al. (2006) Innovative techniques are put 
forward for identifying and eliminating the various 
types of malware, with deep learning computations 
playing a crucial role. Even while the development of 
DL-based portable malware detection techniques has 
received a great deal of attention, it hasn't been 
thoroughly examined yet. The objective of this effort 
is to identify, compile, and review the published 
publications related to the use of deep learning 
techniques to the detection of portable malware. 


One of the major challenges in programming 
advancement and programming language research for 


further enhancing programming quality and 
dependability is ality expectation (Akimova et al., 
Deform et al., 2007). The problem in this area is to 
accurately and very precisely identify the corrupted 
source code. Developing a prediction model with 
shortcomings is a challenging problem for which 
several approaches have been put out throughout 
history. 


3. PROBLEM IDENTIFICATION 

It is typical development practice to verify and 
examine source codes using analytical techniques. 
This procedure may be carried either automatically or 
manually with the use of tools for dynamic and static 
code analysis, among other things. Static code 
analysis has seen a recent surge in tool development, 
offering really useful, added-value solutions to many 
of the issues that software development companies 
encounter. However, these techniques are difficult to 
utilize in real-world scenarios due to a large number 
of false positive and false negative outcomes. 
Therefore, another technique or approach-such as 
Machine Learning (ML) algorithms-must be 
discovered for static code analysis. 


The problems that have been identified based on 

previous studies are listed below: 

> It is not always possible to identify relevant 
software flaws. 

> A software bug's recovery is not entirely 
recognized. 

> Because of its poor precision, the unnamed 
software problem may be detected. 


4. RESEARCH OBJECTIVES 

The objectives of the proposed work: 

> To increase accuracy for flawless software bug 
retrieval. 

> To increase recall for software faults that are 
absolutely applicable throughout the retrieval 
process. 

> To increase the precision of software bug 
detection. 


5. METHODOLOGY 

The Algorithm of proposed methodology FPRNN- 
HTF (Forward Pass RNN with Hyperbolic Tangent 
Function) is as follows 


I = Number of input layers 

H = Number of hidden layers 

O = Number of output layers 

S = Number of data set instances 
Step 1: fori=1toH 

Step 2: forj=1toS 


@ IJTSRD | Unique Paper ID -IJTSRD60159 | Volume—7 | Issue—6 | Nov-Dec 2023 


Page 269 


International Journal of Trend in Scientific Research and Development @ www..ijtsrd.com eISSN: 2456-6470 


calculating the forward for the forward hidden layers 
with activation function 


pe =tanh (w hi. cr wi a+ bi ) 
end for 
Step 3: for j=S to 1 


calculating the backward pass for the backward 
hidden layer’s activation function 


h?=tanh (W;’h?, +W)x, +b} ) 
end for 

end for 

Step 4: fori =1 to O 


calculating the forward pass for the output layer using 
the previous stored activation function 


P(y, 


Wy is the weight matrix connecting the hidden layer 
to output layer, 


{x},,, }=o(Wy a! +W?n? +b, ) 


Wn is the weight matrix that connects hidden to 
hidden layer, 


and W, is the weight matrix that connects input layer 
to hidden layer. 


by is the output layer bias vectors, and bn is the hidden 
layer bias vectors. 


For the final nonlinearity r, and use tanh as an 
activation function for classification. According to 
this form, the RNN will evaluate the output y; 
according to the information propagated through the 
hidden layer regardless of whether it depends directly 


t 
i= 


or indirectly on the values {x,}._, ={%.%).-.%,}- 


= jupyter solt def prediction{SVM) juneavad eangas! @ w 


Figure 2: Complexity Evaluation of Bug 
Frequency for FPRNN-HTF (Proposed 
Prediction Model) 
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Figure 3: Calculation of confusion matrix, 
precision, recall, F1-Score and accuracy among 
different models and FPRNN-HTF (Proposed 
Prediction Model) 


Table 1: Estimation of Precision, Recall, F1- 
Score and Accuracy among different models and 
FPRNN-HTF (Proposed Prediction Model) 


end for ne F1- 
Models Precision Recall S Accuracy 
end for core 
: Random 
The Architecture of proposed methodology FPRNN- Parest 0.9 0.83 | 0.9 | 80.65 % 
HTF (Forward Pass RNN with Hyperbolic Tangent Naive 
Function) is as follows Bayes 0.94 0.82 | 0.88 | 80.10 % 
sug = Logistic | 993 | 08 | 0.88 | 79.5% 
petaset LS) SON La loveprocessing Lets] reprocessed Ln] Classification Lol ae Regression 
eu 0.83 | 0.84 | 0.83 | 74.86 % 
Figure 1: Process of proposed work ANN 0.92 0.34 | 0.9 | 82.77% 
6. RESULTS AND ANALYSIS FPRNN- 
Python 3.11.1 on Anaconda Navigator and a Jupyter HTF 0.95 0.96 | 0.98 | 96 % 
notebook are used to take the following metrics. The (Proposed) 
computation of precision, recall, Fl-Score, and 
accuracy is determined by using the recommended 
FPRNN-HTF method on CS1.csv data from the 
PROMISE dataset. 
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Figure 4: Graphical Analysis of Precision among 
different models and FPRNN-HTF (Proposed 
Prediction Model) 


When compared to alternative models in the context 
of bug prediction, the following graphic shows that 
the recommended model offers higher accuracy. In 
terms of accuracy, FPRNN-HTF beats Naive Bayes 
by a margin of 0.01. 
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Figure 5: Graphical Analysis of Recall among 
different models and FPRNN-HTF (Proposed 

Prediction Model) 


The graph above illustrates how the proposed model 
outperforms earlier models in terms of recall for bug 
prediction. FPRNN-HTF has a 0.12 improvement in 
recall over the Decision Tree and ANN prediction 
models. 
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Figure 6: Graphical Analysis of F1-Score among 
different models and FPRNN-HTF (Proposed 
Prediction Model) 


The graph above illustrates how the proposed model's 
Fl-score is greater than that of earlier models. In 


terms of Fl-score, FPRNN-HTF is superior than 
Random Forest and ANN by 0.08 points. 
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Figure 7: Graphical Analysis of Accuracy among 
different models and FPRNN-HTF (Proposed 
Prediction Model) 


When compared to current models, the following data 
shows that the recommended model offers a greater 
accuracy for predicting bugs. The accuracy of 
FPRNN-HITF prediction model is 13.23% greater 
than that of ANN prediction model. 


7. CONCLUSION 

We have used the FPRNN-HTF model in this 
investigation to get the intended findings. Our 
research demonstrates that prior attempts did not pay 
enough attention to feature selection and cross 
validation. The recommended technique outperforms 
other ones in terms of accuracy (98.16%) on large 
datasets. The combination of the five developed 
approaches yields the best results since it is 
computationally demanding (it avoids overfitting and 
gives fast prediction speeds) and versatile in 
application (it can be used for both regression and 
classification problems). Investigating this method 
further for bug prediction in deep learning models has 
been a continuous endeavor. 


There should be conclusions to these: 

1. The accuracy of the proposed model is higher 
than that of FPRNN-HTF. The accuracy has 
increased by 0.01 compared to Naive Bayes. 


2. The proposed model achieves higher recall than 
FPRNN-HTF Regression. FPRNN-HTF has a 
0.12 improvement in recall over the Decision 
Tree and ANN prediction models. 


3. In terms of Fl-Score, the suggested model 
performs better than the FPRNN-HTF. The 
difference between Random Forest and ANN is 
0.08. 

4. The proposed model has a greater accuracy in 


comparison to ANN. There is an accuracy 
increase of 13.23%. 
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Thus, for software bug prediction, FPRNN-HTF 
(Forward Pass RNN with Hyperbolic Tangent 
Function) is a more accurate method. 


We propose a technique that enhances diagnostic 


precision-a 


critical component of successful 


treatment. New datasets should be used to evaluate 
the accuracy in the future, and further AI approaches 
should be used to confirm the correctness of the 
estimate. Owing to the enormous amount of data 
required for train data performance estimate, the 
suggested model has a processing time limit. The 
effectiveness of the system will be estimated in the 
future using real-time data and the same algorithms. 
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